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ABSTRACT 

At the root of the performance assessment movement is 
fairly widespread dissatisfaction with high-stakes multiple-choice 
tests. Many critics of multiple-choice tests argue that to improve 
instruction, tests themselves will have to improve. Hundreds of 
schools around the country are already experimenting with performance 
assessments, and many states are also experimenting with performance 
assessment. The states farthest along the road to performance 
assessment are Connecticut and Vermont. Vermont is the first state to 
use portfolios as part of a statewide assessment program. Among the 
advantages of performance assessments are the authenticity of what 
they say about what is assessed, the ways in which they offer 
students genuine intellectual challenges, and the opportunity they 
offer for restructuring schools. Problems with performance 
assessments are recognized in the areas of scoring, validity, 
instruction versus accountability, time constraints and teacher 
resistance, and cost. Those who decide to explore performance 
assessments can benefit from the experience of those who have already 
worked in this area. A supplement lists 27 resources for those 
interested in performance assessment, as well as organizations active 
in the field and state contact people. (SLD) 
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To supplement or replace multiple-choice tests, educators 
in many states are devising assessments that require students to produce 
rather than reproduce knowledge. 



To be promoted to Mount Olympus, Hercules had to complete 121abors. If he 
were a student in a contemporary American classroom,however, he would probably 
have to take a multiple-choice test to stake his claim to immortality, filling in tiny 
circles on an answer sheet instead of obtaining golden apples or rescuing sacred 
beasts. 

But in some states, Hercules would get to demonstrate hisability by performing 
tasks rather than answering multiple-choice questions. Such demonstrations, or 
performance assessments, are being touted not only as a better way to foster and 
measure student achievement but also as a lever for school restructuring. 

Frustration with Multiple-Choice Tests 

At the root of the performance assessment movement is a fairly widespread 
dissatisfaction with high-stakes multiple-choice tests. In this era of accountability, 
policymakers and the general public are clamoring for more and more information 
on student achievement. Because multiple-choice tests are easy to administer, 
relatively inexpensive, and seemingly objective, they are by far the most popular 
means of large-scale assessment 

However, such tests are beset with problems, say critics. ''Rather than being 
'objective' instruments, standardized tests often produce results that axe inaccurate, 
inconsistent and biased against minority, female and low-income students/' write 
Noe Medina and Monty Neill (p. 3). Whaf s more, critics declare, the tests are not 
a cred ible gauge of students' ability to solve problems, read, or write, much less to 
think creatively or work together. Finally, critics claim that multiple-choice tests 
corrupt the teaching process. Pressured to produce high test scores, teachers 
narrow instruction toward the limited skills targeted by the tests. 

Performance Assessment 

It is unrealistic to expect teachers to change instructional methods in the face 
of tests that deter such changes. Forinstructiontoimprove, the tests themselves will 
have to change, say many multiple-choice test critics. Thus, educators in some 
states are developing a new generation of tests called performance assessments, or 
authentic assessments, in which students demonstrate mastery by executing tasks 
or creating products rather than by selecting responses from a fixed menu. 
Performance assessments may involve: 

• Uniform tasks, such as timed essay questions or open-ended math problems, 
which students take statewide at a designated time. 

♦ Embedded tasks, or tasks incorporated into the classroom setting, which call on 
students to frame problems, conduct research, design science experiments, write 
reports, give presentations, converse with teachers, or make videotapes. 
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• Portfolios, or systematic collections of 
student work, including a wide range 
of finished products, early drafts, 
teachers' comments, and students' 
reflections on their own work. 
Portfolios enable teachers to evaluate 
student growth as well as discrete 
achievements. 

Uniform tasks are often developed by 
states or testing companies with input 
from teachers, whileembedded tasksand 
portfolios are often developed by teachers 
in collaboration with experts. 

To be sure, many teachers have 
always used some version of performance 
assessment in the classroom. English 
teachershavestudentswrite,for example. 
One of the purposes of the performance 
assessment movement is to get more 
teachers to do the same. But advocates 
look beyond the individual classroom 
toward school, district, and state 
assessments based on performance. They 
hope to devise tests that will meet the 
public's demand for accountability even 
while improving classroom instruction. 

Schools and Districts 

Hundreds of schools around the 
country are already experimenting with 
performance assessments. One of the 
most acclaimed is IPS Key School in 
Indianapolis, where videotapes of student 
projects form the principal means of 
evaluation. Schools belonging to the 
Coalition of Essential Schools, such as 
Walden III in Wisconsin and Central Park 
East in New York, require students to 
demonstrate mastery through an 
"Exhibition" before they graduate. 

In some cases, whole districts have 
committed to performance assessment. 
In Pittsburgh, for example, secondary 
school teachers, working with researchers 
from the Educational Testing Service and 
Harvard University, have developed a 
districtwideportfolio process called ARTS 

The views expressed in this publication are 
those of the author and do not necessarily 
represent positions of the Indiana Education 
Policy Center or its finders, the Lilly Endow- 
ment, Inc., and Indiana University. 

© 1992 Indiam Education Policy Center 



SAMPLE PERFORMANCE ASSESSMENT = 
What Does It Cost to Take a Shower? 
Part 1 : Getting Started by Yourself 

Think of five energy costs involved in taking a shower and identify th» information 
needed to estimate these costs. 

Part 2: Group Work • ^llll ' 

(a) How much energy and water Is needed to take a showor? Whatiril^ 
these components? Design and carry out a study to answer 'Ifniij^^^^^^ 
completely as possible. 

(b) Write a report of your investigation. Make certain you address aiicif^ 
on the Objectives Rating Form. 

(c) Present the results of your study to the class, including visual ma\9f^^^^^>m 
presentation clearer. 

Part 3: Finishing by Yourself 

Using the data from Part 2, compute the savings In a single showtf th^^ 
achieved by (a) reducing the rate of water flow by 30% and (b) lowering th« water 
temperature by 2°C. 



A task (abridged) from Connecticut's Common Core of Learning Performanp*:;^ 
Project, sponsored by the National Science Foundation. 



PROPEL to document student learning 
inwriting,music,andart. New standards, 
curricula, and teaching practices are being 
developed hand-in-hand with the 
portfolio process. 

Statewide Performance Assessments 

Many states are also experimenting 
with performance assessment. For 
example: 

• All 4th graders in New York take a 
hands-on science test, rotating through 
five stations and using real science 
equipment to solve problems. 

• California has added open-ended math 
problems to its 12th grade math test 
and a timed essay to its 8th and 12th 
grade language arts tests and is 
resolutely field testing new assessments 
in other subjects despite severe budget 
constraints. 

• Indiana (and over two dozen other 
states) also uses a timed essay as part of 
its statewide assessment program. 



• Arizona and Maryland conduct a two- 
day writing assessment that lets 
students prewrite and draft on the first 
day and rewrite on the second. Both 
states are also developing performance 
tests in reading/ math/ and other 
subjects. 

• Kentucky is committed to imple- 
menting a multi-subject/ multi-grade 
performance assessment system by 
1995. 

The two states farthest along the 
performance assessment trail are 
Connecticut and Vermont Supported by 
a $1 million grant from the National 
Science Foundation, Connecticut teachers 
and researchers arespearheadin^amulti- 
state compact of schools that assess 
sustained performance tasks in science 
and math. These tasks may take students 
weeks or months to complete and may 
involve group as well as individual work* 
Teachers evaluate not only the products 
that result but also student attributes 
such as curiosity and persistence. 
Connecticut's long-range goal is to 
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incorporate performance assessment into 
its statewide testing program. 

Vermont is the first state in the nation 
to use portfolios as part of a statewide 
assessmentprogram. Students in 4th and 
8th grade compile portfolii/i in writing 
and math. The Vermont Department of 
Education has established minimum 
requirements for portfolio content, but 
individual teachers have ample leeway 
as to what to include. Teachers grade 
their own students' portfolios based on 
state standards. To ensure consistency in 



Authentic assessments replicate 
the challenges that typically face 
businesspeople, scientists, or historians. 
—Grant Wiggins 



scoring, random samples of portfolios 
are selected for grading by a second 
teacher. If significant discrepancies arise, 
state-trained assessors adjudicate. 
Vermont's program also has two other 
components: a "best piece" chosen by 
each student and a uniform task 
administered statewide. 

Benefits 

Performance assessments are real, 
advocates say. "Authentic assessments 
replicate the challenges and standards of 
performance that typically face writers, 
businesspeople, scientists, community 
leaders, or historians," says Grant 
Wiggins (pp. 703-704). As such, they give 
direct evidence of higher-order thought 
processes rather than the circumstantial 
evidence provided by multiple-choice 
tests. 

Just as important as what per- 
formance assessments tell educators 
about students is what they do for the 
students themselves: offer genuine 
intellectual challenges. "Participation in 
authentic tasks is more likely to motivate 
students," says Fred Newmann (p. 460). 
Particularly in portfolio programs, 
students also learn to reflect on what they 
produce and to revise accordingly — 
habits of thought crucial to success in the 
real world. 

Finally, performance assessment 
may be a way to res tru ctur e th e sch ool s — 



in the reverse order of the current 
approach. As things stand, many 
reformers change the organizational 
structure of a school first and expect 
changes in curriculum and instruction to 
follow. However, "organizational 
changes alone will not modify long- 
standing conceptions of knowledge and 
deeply socialized habits of didactic 
teaching," says Newmann (p. 463). If we 
first decide what abilities students 
should acquire, then give them the chance 
to demonstrate those abilities, and fi- 
nally develop modes of organization and 
instruction to facilitate the demon- 
strations, "restructuring could form a 
new chapter in educational history" 
(p. 463). 

Problems 

Despite such glowing tributes, 
performance assessment is not without 
its share of problems. Among them: 

Scoring. Whatever bias may creep 
into the content of multiple-choice tests, 
the scoring is unyieldingly objective. 
Critics of performance assessment 
(including many parents) worry that 
evaluation of open-ended tasks will let 
subjectivity slip back into the grading 
process, opening a whole new Pandora's 
Box of unfairness and unreliability. 

Assessmen t d e velopers ha ve evolved 
a three-point strategy to address scoring 
concerns: 

• Formulate explicit, systematic criteria 
for tasks. 

• Make sure those rating the tasks are 
thoroughly trained. 

• Usemultipleratersandmonitorresults 
to ensure that criteria are applied 
consistently. 

Validity. Validity is a technical term 
referring to the extent to which a test 
actually measures what it is supposed to 
measure. Psychometricians use sophis- 
ticated techniques for increasing the 
validity of multiple-choice tests, but 
similar techniques have yet to be 
developed for performance assessments. 
"What should especially trouble policy 
makers," says Gregory Cizek, is that "we 



have begun a search for genuine-fooJtmg, 
authenHc-Ioofa'ng,real-IooJb'ngassessments 
and have eschewed more rigorous 
standards of validity" (p. 699). 

Multiple-choice tests may be superior 
in terms of technical validity, but that 
does not compensate for other flaws, says 
Dale Carlson, director of the California 
Assessment Program: "The narrowing of 
the curriculum that is taking place, 
inadvertently,as a result of our not [using 
performance assessments] is hi the long 
run more damaging than anj' statistical 
problems that may be occurring in the 
short run" (in Rothman, p. 10). Other 
proponents assert that the superiority of 
performance assessment in terms of 
"ecological validity" — fidelity to the 
overall goals of the curriculum — more than 
makes up for statistical deficiencies. 



We have begun a search for 
genuine-looking assessments and have 
eschewed more rigorous standards of 
validity. 
— Gregory Cizek 



Instruction vs. Accountability. It 
often seems that performance assessments 
with the greatest potential for improving 
instruction — portfolios and extended 
tasks — are least amenable to aggregate 
scoring (which enables comparison across 
schools), while performance assessments 
that are easiest to aggregate — uniform, 
state administered tasks — are the least 
likely to improve instruction. "There is no 
question that if we wanted to assess 
students for the purposes of improving 
the quality of instruction, we could do it 
tomorrow," says Wiggins. "The problem 
is, for a variety of reasons, aost people 
still want a technically rigid, comparative 
accountability system. Demanding that 
simultaneously is a real problem" (in 
Rothman, p. 10). 

Time and Teacher Resistance. 
Performance assessments can impose a 
severe burden on teachers accustomed to 
preparing students for traditional tests. 
Old-fashioned drill-and-practicemethods 
are inappropriate for authentic assess- 
ments, and many teachers have to revamp 
their classroom approach. Evaluating 
tasks also takes much more time than 
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simply grading answer sheets. "I thought, 
'Oh God, here's another thing they're 
throwing at us/" said one Vermont 
teacher (inBandler and Hoi combe, p. 15). 
Despite the new demands, though, 
teachers in Vermont and elsewhere, 
heartened by the enthusiastic responses 
of students, have f or tlv ,» most part reacted 
favorably to such programs. 

Cost Vermont has spent about 
$400,000 per year over three years for its 
writing and math program in grades 4 
and 8; Kentucky is spending $3.5 million 
during the first year of development of its 
multi-subject, multi-grade assessments 
and plans to spend up to $30 million over 
the next five years. It costs less than a 
penny for a computer to grade an answer 
sheet, versus 80* to $5.00 for people to 
grade a uniform task. In short, 
performance assessments are expensive 
to develop and score, although the costs 
vary widely depending on the type and 
extent of assessment. Whatever the cost, 
experts say thata big portion of it must be 
budgeted for teacher training and 
planning time, especially in programs 
that use sustained tasks or portfolios. 



Although expensive, performance 
assessment offers hidden benefits, 
proponents claim. It motivates students. 
It helps teachers refine standards for 
student achievement. And it provides 
some of the best staff development that 
teachers will ever get 

Suggestions 

If a state decides to explore 
performance assessment, policymakers 
can profit from advice given repeatedly 
by those who have worked in the field: 

• Totheextentpossible,involveteachers 
at every stagein theassessment process: 
development, scoring, revising. 

• Allow for necessary increases in staff 
development and planning time. 

• Start small, with one or two subjects in 
one or two grades, before moving to 
more extensive programs. 

Perhapsmostimportantly,saysJoan 
Baron, director of Connecticut's progi am, 



"Be patient/' Giving developers time to 
wrestle with still-existing problems may 
ultimately determine whether per- 
formance assessment is just another 
educational fad or a lever for genuine 
school reform. 
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Organizations Involved in Performance Assessment Research or Development 



Center for Research on Evaluation, 
Standards,and Student Testing (CRESST) 
Eva Baker and Robert Linn, Co-directors 
Contact person: Ron Dietel, Director of 

Communications 
UCLA Graduate School of Education 
405 Hilgard Avenue 
145 Moore Hall 
Los Angeles, CA 90024 
Phone: 310-206-1532 

Center on Learning, Assessment, and 

School Structure (CLASS) 
Giant Wiggins and Holly Houston, 

Directors 
39 Main Street 
Geneseo, NY 14454 
Phone:716-243-5500 

Council for Basic Education 
Ruth Mitchell, Associate Director 
725 Fifteenth StTeet, N.W. 
Washington, DC 20077 
Phone: 202-347-4171 



Educational Testing Service (ETS) 
Mary Fowles, Principal Measurement 

Specialist 
P-06 

Rosedale Road 
Princeton, NJ 08541 
Phone: 609-734-5228 

National Commission on Testingand Public 
Policy 

George Madaus, Executive Director 
McGuinn Hall, Room 531 
Boston College 
Chestnut Hill, MA 02167 
Phone:617-552-4521 

Northwest Evaluation Association 
Allan Olson, Executive Director 
5 Centerpointe Dr., Suite 100 
Lake Oswego, OR 97035 
Phone:503-624-1951 



PACE (Performance Assessment 

Collaboratives for Education) 
Dennis Palmer Wolf, Director 
8 Story Street 
Cambridge, MA 02138 
Phone: 617-496-2770 

Portfolio Assessment Clearinghouse 
Winfield Cooper and Jon Davies, 

Co-directors 
c/o San Dieguito Union High School 

District 
710 Encinitas Blvd. 
Encinitas, CA 92024 
Phone: 619-753-6491 

Student Assessment Consortium 
Council of Chief State School Officers 

(in collaboration with CRESST) 
Ed Roeber, Director 
One Massachusetts Avenue, N.W., 
Suite 700 

Washington, DC 20001-1431 
Phone: 202-408-5505 



Stat* Contact People 

Joan Baron 

Common Core of Learning Assessment 

Coordinator 
Connecticut State Department of Education 
165 Capital Avenue, Room 340 
Hartford, CT 06106 
Phone: 203-566-5454 

(Note: A comprehensive monograph on 
Connecticut's progress in developing 
performance assessments is scheduled for 
publication in the summer of 1992.) 



Dale Carlson 

Assistant Superintendent of Public 

Instruction 
California Assessment Program 
California Department of Education 
PO Box 944272 
721 Capitol Mall 
Sacramento, CA 95816 
Phone: 916-657-3011 



Geof Hewitt, Writing Consultant 
Robert Kenney, Mathematics Consultant 
Vermont Portfolio Assessment Program 
Vermont Department of Education 
120 State Street 
Montpelier, VT 05620 
Phone: 802-82^-3111 



Indiana Education Policy Center 
Bloomington Office 
School of Education 

Suite 326 
Indiana University 
Bloomington, IN 47405 
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